Dialogue focus tracking for zero pronoun resolution
نویسندگان
چکیده
We take a novel approach to zero pronoun resolution in Chinese: our model explicitly tracks the flow of focus in a discourse. Our approach, which generalizes to deictic references, is not reliant on the presence of overt noun phrase antecedents to resolve to, and allows us to address the large percentage of “non-anaphoric” pronouns filtered out in other approaches. We furthermore train our model using readily available parallel Chinese/English corpora, allowing for training without hand-annotated data. Our results demonstrate improvements on two test sets, as well as the usefulness of linguistically motivated features.
منابع مشابه
RAFT/RAPR and Centering: A Comparison and Discussion of Problems Related to Processing Complex Sentences
Several researchers have noted the local coherence exhibited by discourse (Sidner 1979; Grosz, Joshi, and Weinstein 1983; Carter 1987; etc.). A primary component of this local coherence is the way the local focus of the discourse shifts from one sentence to the next and the way this shifting is marked by linguistic choices made by the writer/speaker. By local focus, we refer to that concept a s...
متن کاملA Machine Learning Approach to Pronoun Resolution in Spoken Dialogue
We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NPand non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron’s (2002) manually tuned system.
متن کاملThe DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems
We describe the DARE corpus, an annotated data set focusing on pronoun resolution in tutorial dialogue. Although data sets for general purpose anaphora resolution exist, they are not suitable for dialogue based Intelligent Tutoring Systems. To the best of our knowledge, no data set is currently available for pronoun resolution in dialogue based intelligent tutoring systems. The described DARE c...
متن کاملAntelogue: Pronoun Resolution for Text and Dialogue
Antelogue is a pronoun resolution prototype designed to be released as off-the-shelf software to be used autonomously or integrated with larger anaphora resolution or other NLP systems. It has modules to handle pronouns in both text and dialogue. In Antelogue, the problem of pronoun resolution is addressed as a two-step process: a) acquiring information about properties of words and the entitie...
متن کاملA Probabilistic Method for Analyzing Japanese Anaphora Integrating Zero Pronoun Detection and Resolution
This paper proposes a method to analyze Japanese anaphora, in which zero pronouns (omitted obligatory cases) are used to refer to preceding entities (antecedents). Unlike the case of general coreference resolution, zero pronouns have to be detected prior to resolution because they are not expressed in discourse. Our method integrates two probability parameters to perform zero pronoun detection ...
متن کامل